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Abstract 



We consider the optimal mass transportation problem in R d with measurably pa- 
rameterized marginals, for general cost functions and under conditions ensuring the 
existence of a unique optimal transport map. We prove a joint measurability result for 
this map, with respect to the space variable and to the parameter. The proof needs to 
p/ ^ ' establish the measurability of some set- valued mappings, related to the support of the 

p I , optimal transference plans, which we use to perform a suitable discrete approximation 

procedure. A motivation is the construction of a strong coupling between orthogonal 
martingale measures. By this we mean that, given a martingale measure, we construct 
in the same probability space a second one with specified covariance measure. This 
is done by pushing forward one martingale measure through a predictable version of 
the optimal transport map between the covariance measures. This coupling allows us 
to obtain quantitative estimates in terms of the Wasserstein distance between those 
covariance measures. 

Keywords: Measurability of optimal transport. Coupling between orthogonal mar- 
tingale measures. Predictable transport process. 
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q ; 1 Introduction 

We consider the optimal mass transportation problem in M. d with measurably parameterized 
marginals, for general cost functions and under conditions ensuring the existence of a unique 
optimal transport map. The aim of this note is to prove a joint measurability result for 
this map, with respect to the space variable and to the parameter. One of our motivations, 
developed at the end, is the construction of a strong coupling between martingale measures. 
That is, given a martingale measure, we shall construct in the same probability space a 
second one with specified covariance measure process. This will be done by pushing forward 
the given martingale measure through the optimal transport map between the covariance 
measures. To make this construction rigorous, we need the existence of a predictable version 
of this transport map, which will be a consequence of our main result. 
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We denote the space of Borel probability measures in R d by V(R d ), and by V p (R d ) the 
subspace of probability measures having finite p— order moment. 
Given vr G V(R 2d ), we write 

if fj,, v G P(R ) are respectively its first and second marginals. Such it is refereed to as a 

"transference plan" between \i and v. 

Let c : M. d — > R+ be a continuous function. The mapping 

7T — » 7(7r) := / c(x,y)ir(dx,dy) 

is then lower semi continuous. 

The Monge-Kantorovich or optimal mass transportation problem with cost c and marginals 
^ consists in finding 

MI (it). 

It is well known that the infimum is attained as soon as it is finite, see |13j . Ch.l. In this 
case, we denote by U*(/j,,u) the subset of T(R 2d ) of minimizers. If otherwise, /(vr) = +00 
for all 7T <„, then by convention we set II* (/x, v) = 0. 

We shall say that Assumption H(ijl,v, c) holds if 

a) [i does not give mass to sets with Hausdorff dimension smaller than or equal to d—1. 

b) there exists a unique optimal transference plan tt G H*(fj,,u), and it has the form 

ir(dx, dy) = fi(dx) ® b~ T {x){dy) 

for a [i(dx) — a.s. unique mapping T : M. d — > R . 

Such T is called an optimal transport map between [i and v for the cost function c. 
Hypothesis a) in H(fi, u, c) is optimal both for existence and uniqueness of an optimal 
transport map, see Remark 9.5 in [14j . We recall that if II* (/i, v) 7^ 0, a) implies b) in the 
following situations (see Gangbo and McCann [3]): 

i) c(x,y) = c(\x — y\) with c : M+ — > strictly convex, superlinear and differentiable 
with locally Lipschitz gradient. 

ii) c(x,y) = c(\x — y\) with c strictly concave, and [i and v are mutually singular. 
Condition b) also holds if 

iii) c(x, y) = c(\x—y\) with c strictly convex and superlinear, and moreover \x is absolutely 
continuous with respect to Lebesgue measure. 

When fj,, v G V p (M. d ), fundamental examples are the cost function c(x,y) = \x — y\ p with 
p > 2 for case i), p > 1 for case iii), and p G (0, 1) for case ii). 

Our main result is 
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Theorem 1.1. Let (E, E,m) be a a— finite measurable space and consider a measurable 
function A € E \— ► G V(M. d ) such that for m— almost every X, H{ix\,v\,c) holds, 

with optimal transport map T\ : M. d — > M d . Then, there exists a function (A, x) i— > T(A,x) 
which is measurable with respect to E ® £>(M rf ) cm<i suc/i f/mi m(dX)— almost everywhere, 

T(X,x) = T\{x) fi\(dx) -almost surely. 

In particular, T\(x) is measurable with respect to the completion of T> ® B(R d ) with respect 
to m(dX)fJLx(dx). 

Theorem 11,11 generalizes Theorem 1.2 in [3], where we constructed a predictable version of 
a quadratic transport map, between a time-varying law and empirical samples of it. 
To our knowledge, other measurability results on the mass transportation problem require 
a topological structure on the space of parameters, or concern transference plans but not 
transport maps (see e.g. [10], or Corollaries 5.22 and 5.23 in [T3]). 

The proof of Theorem II .11 is developed in the following section. We firstly establish a type 
of measurable dependence of the support of the optimizers on A. From this result, we can 
define measurable partitions of E x M. d induced by a dyadic partition of M. d , and construct 
bi-measurable discrete approximations of T(A,x). This approximation procedure was not 
needed in the simpler case studied in [3], where one of the marginals was an empirical 
measure (thus with finite support). 



2 Proof of Theorem 11.11 

Let us first state an intermediary result concerning measurability properties of minimizers 
in the general framework. Its formulation and proof require some notions of set-valued 
analysis, see e.g. Appendix A of [9]. 

Theorem 2.1. The function assigning to (//, u) the set of] 

v) := Adh\ [J supp(ir) 
Vren*(^) 

is measurable in the sense of set-valued mappings. That is, for any open set 9 in M. 2d , its 
inverse image ^(O) = {(//, v) G (V(R d )) 2 : v)f\6 / 0} is a Borel set in (V(R d )) 2 . 

Remark 2.2. In the case of a set-valued mapping taking closed-set values, measurability 
is equivalent to the fact that inverse images of closed sets are measurable (see 

Proof. The idea of the proof is similar to the one of Theorem 1.3 in [3], where we considered 
the quadratic cost and the measurable structure induced by the Wasserstein topology. In 
the present case, the spaces V(M. d ) and ^(IR 2 ^) are endowed with the usual weak topology. 
We observe that \£ writes as the adherence of a set-valued composition, 

v) = Adh (U o S(jjl, v)) := Adh j [J U(tt) ) , 

where S and U are the set-valued mappings respectively defined by 

S(fJ,, v) := v) and U(ir) := supp(ir). 
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Measurability of is equivalent to U o S being measurable. The latter will be true as soon 
as S is measurable and U~ 1 {9) is open for every open set 9 (see [9]). 

The stability theorem for optimal transference plans of Schachermayer and Teichman (The- 
orem 3 in [11]) exactly states that inverse images through S of closed sets in ^(R 2 ^) are 
closed sets in (V(M. d )) 2 . This, together with the fact that the mapping S takes closed-set 
values (by lower semi continuity of I (it)) imply that 5 is a measurable multi-application. 
On the other hand, the inverse image by U of an open set 9 of M. 2d is 

IT 1 (6) = {it G V(R 2d ) : supp(ir) n 6 + 0} = {vr G 7?(IR M ) : n(6) > 0}. 

It then follows by the Portmanteau Theorem that U~ 1 (9) is an open set in V{R 2d ), and 
this concludes the proof. 

□ 

Corollary 2.3. Let (E,T,) be a measurable space, and A £ E i— > v\) G (T^O^)) 2 a 
measurable function. We consider the function ^ defined by (QP and let F be a closed set 
ofR d . Then, the set 

{(\i):({i}xF)n%^)/l} 

belongs to S <g) B(M. d ). In particular, if for all A G E, U*(/j,\, u\) = {tt\\ is a singleton, the 
set 

F := {(A, x) : ({x} xF)n supp(ir x ) + 0} 

is measurable. 

Proof. Without loss of generality, we assume that F is nonempty. Let us first show that 
for any open set 6 of M. 2d , the set 

G={zeM. d : ({z} xF)n9/l} 

is open. Indeed, for x G G there exists y G F and e > such that B(x,e) x B(y,s) C 9. 
In particular, for all z G B(x,e) one has (z,y) G 9 and so B(x,e) C G. By definition of 
measurability, the set-valued mappings (A, x) — > {x} x i 7 and (A, x) — > ^(/^A; ^a) — ({^} x F) 
are thus measurable. The latter mapping being also closed valued, we conclude that 

| (A, x) : M»x,vx) ~ ( {*} x F)} n {0} ^ 

is a measurable set, which finishes the proof. 

□ 

Let us now focus on the proof of Theorem 11.11 

Proof of Theorem 11.11 Since any er-finite measure is equivalent to a finite one, we can 
assume without loss of generality that m is finite. 

For a fixed k > 1, we denote by (A ni k) n€ z d the partition of M. d in dyadic half-open rectangles 
of size 2~ dk , that is 

d 

A n , k := Yl 
i=i 



2T' n ' 2 t 1 ) ' where n = ( ni > ■■■> n d) e 
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Consider the sets B n ^ = {(A, x) G E x R d : ({x} x A n) fe) PI suppi^x) ^ 0}. Notice that since 



A n ,k — UjeN rii=i 



Hi rii + l 1 



2*s ) 2 k 2 k +3 



one has 



B n , k =\J l(\,x) : [{x}xH 
jeN { V 



i=l 



n; rij + 1 1 



2fc ' 2 fc 2 fc+ J 



n supp{-K\) / 



and so is measurable thanks to Corollary | 
Denote now by a n ^ k G A n k the "center" of the set, and define aS® measurable 
function by 

T fc (A,x)= Y, *n,kl Bnik (\x). (2) 

For each A G let vk be the discrete measure defined by pushing forward \i\ through T k , 
that is, 

v k x {A) = J l THKx)eA nx(dx), A G B(R d ). 



Denote also by E G £ a measurable set with m(E c ) = and such that for all \ <E E, 
H((j,x,v\,c) holds. 

By hypothesis, for each A G £7 we have that 



/iA(tfcc) almost surely: l Bn fc (A,a;) = I^t^sA^}- 
where T\ has been defined in the statement of Theorem II .li This implies that 

V\(i a n,k}) = / lB nk (\x:)n\(dx) = ^\{{x \ T\{x) G A n , k }) = V\(A n , k ) 



(3) 



by definition of T\. 

We now check that (T )kevi is a cauchy sequence in 

and for each n G Z d denote by {^4 n ',fe'}n' the unique partition of A nj fc in dyadic rectangles 
of size 2~ dk . We then have that 



\T k (X,x)-T k '(X 



Y E 1 B n , k ,{\x)\a ntk -a nl)k >\iix{dx)m{dX) 



- / E E « / A(A,',fc')"»(dA) 

E n& d n':A n , k ,CA n)k 

< I J2 2- k u x (A n>k )m(dX) 

k I v x (R d )m(dX) = 2- k m(E), 



and the Cauchy property follows since m{E) < oo. 
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Let us denote by T the limit in L l (E x M. d , m(d\)ij,\(dx)) of the sequence T k . Theorem 
11.11 will be proved by verifying that for all A in a set of £ of full m- measure set, one has 
ir\(dx,dy) = n\(dx)5x(\ t x)(dy)- Hence, it is enough to check that 

^CxA ntk {x,T(\,x))nx{dx) = vr A (C x A ntk ) 

for any semi-open rectangle C with dyadic extremes and all n G Z d , k G N. We have for 
A G E and any j G N that 



7T A (C x A nk ) - / lcxA nk (x,T(\,x))fi\(dx) 



< 



vr A (C x A n>fc ) - / l C xA nk {x,Ti{\x))iJ,x{dx) 



(4) 



+ 



lcxA n , k {x,T 3 {\,x)) - l C xA n , h (x,T(\,x)) fi x (dx) 



:=A, • A' . 

We approximate 1a„ fc by a Lipschitz continuous function / Ae such that ||/a, £ ||oo < 1 and 
H\{{y '■ fx,e(y) 1a„ k }) < e (this is possible thanks to H(/i\, v\, c), a)). Hence, the second 
term A^ on the r.h.s. of @ is bounded by 

4e + L x , £ J |T^'(A,x) -T(A,x)|/i A (dx), 

where L Ai£ is the Lipschitz constant of / Aj£ . Since J |T J (A,x) — T(A,x)| fj,\(dx) converges 
in L 1 (m(d\)) to 0, there is a subsequence T Ji and a set E G S of full measure such that 
the convergence holds for all A G E. Consequently, for all A G E := E D E we get that 

limsup Aj. = limsup / \l C xA nik (x,T jl (X,x)) - l C xA riik (x,T(X,x))\ fi\(dx) < 2s, 

i^oo i— >oo J 

and since the l.h.s. does not depend on e, this means that lim^oo A'-, = 0. 
The proof will be achieved be verifying that for fixed A G E, one has Aj = for all large 
enough j. For such A, fix a Borel set D\ of M. d of full /i A measure where (|3j) is everywhere 
true. Then, 



*-CxA nk 



{x,T 3 {\,x))ii x {dx) = j l^^OxA^ix^ 3 (\,x))nx{dx) 



S 



(D x nC)xA nik x, 



^d.v< •.. .i„ , I x , a mJ l Am ^(Tx(x)) I nx{dx). 



Remark now that for all j ' > /c, y G A n ^ < — > X^m:a m -6A n & 

li Bj (!/) G A n , k . Then, for 

all j > k, 



lcxA n:k (x,T J (\,x))nx(dx) = / l {DxnC)xAn k (x,Tx(x))nx(dx) = vr A (C x ^ Uife ) 



□ 
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3 Application: strong coupling for orthogonal martingale 
measures 



We now develop an application of Theorem (jl.ip . Let .Ft,P) be a filtered probability 

space and consider M an adapted orthogonal martingale measure on M + x M. d (in the 
sense of Walsh [E]). Assume that its covariance measure has the form qt(da)dkt, where 
qt(io,da) is a predictable random probability measure on W 1 with finite second moment 
and k t a predictable increasing process. Let us also consider another predictable random 
probability measure q t (u>,da) on M. d with finite second moment. 

We want to construct in the same probability space a second martingale measure with 
covariance measure qt(oJ,da)dkt, in such a way that in some sense, the distance between 
the martingale measures is controlled by the Wasserstein distance between their covariance 
measures. Recall that this distance is defined for fi, u € ) by 

Wi(n,v)= mi :/(tt) 

7T<£ 

with the quadratic cost 

c(x,y) = \x - y\ 2 . 

This distance makes the set T > 2^ d ) a Polish space, and strengthens the weak topology with 
the convergence of second moments (see [8]). 

Theorem 3.1. In the previous setting, assume moreover that ¥(du)dkt(uj) a.e. qt has a 
density with respect to Lebesgue measure in M. d . 

Then, there exists in (Cl, T , J-t,W) cl martingale measure M on M + xIR rf with covariance mea- 
sure qt(da)dkt, such that for all S > and for every predictable function <fi : Clx~R + xM. d — ► R 
that is Lipschitz continuous in the last variable with E ( J (p 2 (s, a) (q s (da) + q s (da)) dk s ^j < 
00, one has 

E ^sup J <t>(s,a)M(ds,da) - J J <p(s,a)M(ds,da) S j J < E (/ L 2 S W%(q s ,q s ) dk^j , 

(5) 

where L s (uj) is a measurable version of a Lipschitz constant of <f>(s , u> , •) and Wj is the 
quadratic Wasserstein distance in )• 

Proof. Since qt(uj, da) has a density for almost every (t, 10), assumption H(q t (uj, da), q t (u, da),c) 
is satisfied. We can therefore apply Theorem [LT] to (E, S, m) = (ClxR + ,Vred, F(du)dkt(u)), 
where Vred is the predictable a— field with respect to Tt- Then, there exists a predictable 
mapping T : Cl x M + x M. d :— > M. d that for m-almost every (t,ui) pushes forward qt to qt- 
Moreover, for a.e. (t,iv), one has 

1 - T(t,us,a)\ 2 q t (u),da) = W%(q t ,q t )- 

On can thus define a martingale measure M by the stochastic integrals 

/ ip(s,a)M(ds,da) := / f ip(s,T(s,a))M(ds,da) 
J Jo J 
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for predictable simple functions ip. Its covariance measure is by construction qt(da)dkt, and 
by Doob's inequality, the left hand side of © is less than 



E(jf J \<j>(s,a) -0(s,T(s,a))\ 2 q s {da)dk s )j < E Qf L\(J \a - T{s,a)\ 2 q s {da^j dk 
< E (J L l Wi(q s , q s ) dk s ^ , by definition of T and of Wf . 

□ 

Remark 3.2. i) The construction of strong couplings between orthogonal martingale 
measures arises classically in the literature, especially in cases where the martingale 
measure M is a compensated Poisson point measure or a space-time white noise, 
for which the covariance measures are deterministic (cf. Grigelionis J2J/ ; El Karoui- 
Lepeltier Tanaka fliy . El Karoui- Meleard Jl|/, Meleard- Roelly [?]/, Guerin 
A classical approach is to use the Skorokhod representation theorem. This however 
prevents any hope to obtain quantitative estimates related to the associated covariance 
measures, what we have been able to do here thanks to the optimal transport maps. 

ii) If the probability space and the martingale measure M are not fixed in advance, a 
coupling satisfying the estimate (|5|) can be constructed from and orthogonal martin- 
gale measure M(dt,da,da') on R + x M. d x M. d with covariance measure irt(da,da')dkt, 
where Tit is an optimal transference plan between qt and qt- Then, M(dt,da,M. d ) and 
M(dt,M. d ,da') are indeed two orthogonal martingale measures with the required co- 
variances and satisfying estimate (13, The question in this situation is however 
how to construct such M . 
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